Features for F0 contour prediction

نویسندگان

  • Ted H. Applebaum
  • Nick Kibre
  • Steve Pearson
چکیده

Decision trees based on features derived from text analysis have previously been used to predict the input parameters of models of F0 contour for text-to-speech synthesis. Yet it is not known which features contribute most to the success of the prediction. This paper quantifies the dependence of the predicted F0 contour on each of several input features derived from the text. Parameters for the Tilt intonation model of F0 contour were predicted by decision trees trained on 6 simple features or 17 features derived from a rule-based front end. To evaluate the contribution of each input feature, F0 prediction error measures were first compared within a group of predictors where each predictor considered only a single input feature, and then within a second group of predictors where each predictor ignored one of the input features. F0 prediction error was measured on a new speaker by RMS deviation, mean absolute deviation and correlation. Similar trends were observed for each error measure. The features observed to most strongly affect F0 prediction were "position in the word of the following syllable", "percent of the way through a breath group", "presence of prosodic boundary at the end of the syllable" and “stress of the current syllable”. These features are defined over different time scales and demonstrate how a local model of F0 contour can capture global properties.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus-based generation of prosodic features from text based on generation process model

A total scheme of generating prosodic features from a text input was constructed. The method consists of corpus-based prediction of pauses, phone durations and fundamental frequencies (F0's), in this order, and information predicted in an earlier process is utilized in the following processes. Since prediction of F0's is done on the command values of F0 contour generation process model instead ...

متن کامل

Sankar Mukherjee

In Text to Speech synthesis system F0 contour plays an important role in conveying prosodic information but the process of synthesizing F0 contour from the underlying linguistic information using deep architecture has not been investigated in case of Bengali languages. This paper describes a method for synthesizing F0 contours of Bengali readout speech from the textual features of input text us...

متن کامل

Generation of F0 contour using stochastic mapping and vector quantization control parameters

This paper introduces an F0 contour generation method for text-to-speech synthesis using stochastic mapping and vector quantization control parameters. This model uses a new F0 contour labelling scheme based on the RFC (Rise/Fall/Connection) model [1], which describes F0 contour patterns with seven F0 labels and three pause labels. This paper also suggests an e cient selection method for contro...

متن کامل

A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI system

This study describes an algorithm for the F0 contour generation system for Korean sentences and its evaluation results. 400 KToBI labeled utterances were used which were read by one male and one female announcers. F0 contour generation system uses two classification trees for prediction of K-ToBI labels for input text and 11 regression trees for prediction of F0 values for the labels. Evaluatio...

متن کامل

Word-level F0 modeling in the automated assessment of non-native read speech

This study investigates methods for automatically evaluating the appropriateness of F0 contours in the task of automated assessment of non-native read aloud speech. The F0 contour of a test taker’s spoken response is represented as a fixed-dimension vector with a word-level F0 value corresponding to each word in the prompt text. This vector is then correlated with gold standard vectors extracte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000